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ABSTRACT 

The globular cluster 47 Tucanae is well studied but it has many characteristics 
that are unexplained, including a significant rise in the velocity dispersion profile 
at large radii, indicating the exciting possibility of two distinct kinematic pop- 
ulations. In this Letter we employ a Bayesian approach to the analysis of the 
largest available spectral dataset of 47 Tucanae to determine whether this appar- 
ently two-component population is real. Assuming the two models were equally 
likely before taking the data into account, we find that the evidence favours the 
two-component population model by a factor of ~ 3 x 10 7 . Several possible expla- 
nations for this result are explored, namely the evaporation of low-mass stars, a 
hierarchical merger, extant remnants of two initially segregated populations, and 
multiple star formation epochs. We find the most compelling explanation for the 
two-component velocity distribution is that 47 Tuc formed as two separate pop- 
ulations arising from the same proto-cluster cloud which merged < 7.3 ± 1.5 Gyr 
ago. This may also explain the extreme rotation, low mass-to-light ratio and 
mixed stellar populations of this cluster. 

Subject headings: globular clusters: individual (47 Tucanae) 



1. Introduction 

As one of the closest and most massive Gala ctic globular cluster s (GCs), 47 Tucanae (47 



Tuc) is a test-bed for Galaxy formation models ( iSalaris et al.l 120071 . and references therein) 
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distance measurement techniques (e.g. 


Bono et al. 


2008) 


McWilliam & Bernstein 




2008 




Lane et 


al. 




2009b|). 


This 



(e.g. 



left s everal unresolved conundrums. While not unique in this respect (see iGratton et al. 



2004 . for a review of elemental abundance variatio ns in GCs), 47 Tuc h as a bimodal dis- 
tribution of carbon and nitrogen line strengths (e.g. lHarbeck et al.l 120031 ) . Furthermore, 47 
Tuc has a complex stellar population, exhibiting, for example, multiple sub-giant branches 



ve-g- 



Anderson et al.ll2009l ). Although multiple Red Giant and Horizont al Branches can also 



be found in ot her GCs, and may be due to chemical anomalies (e.g. iFerraro et al.l 12009 



Lee et al.ll2009i ). 47 Tuc is particularly unusual in many respects. It h as an extreme rota- 
tional velocity (a property it shares only w ith M22 and u Centauri, e.g. iMerritt et al.l 11997 



Anderson Sz Kin g 2003; lLane et al. 



pi 

2 



persion profile at large radii (lLane et al. 



2009 



j) and an apparently unique rise in its velocity dis- 



2009b|). 



Furthermore, the mass-to-light ratio (M/Ly) of 47 Tuc is very low for its mass (lLane et al 



20101 ) . that is, it does not obey the mass-M/Ly relation described by iRejkuba et al.l ( 120071 ) . 
Note that this mass- M/Ly relation is not due to the presence of dark matter but because 
of dynamical effects ( iKruijssenl 120081 ). Explanations for these unusual properties may be 
intimately linked to its evolutionary history. In this Letter we describe and analyse various 
ex planations for the ri se in velocity dispersion in the outer regions of 47 Tuc initially reported 
bv lLane et aD (l2009bf ). 



Lane et al.l ( j2009bl ) provided a complete description of the data acquisition and reduc- 
tion, radial velocity uncertainty estimates, the membership selection process, and statistical 
analysis of cluster membership for all data presented in this Letter. 



Plummer Model Fits 



The iPlummerl (Il91ll ) model (see also lDejonghelll987l ) predicts that the isotropic, pro- 
jected velocity dispersion a falls off with radius r as: 



a(r; {a ,r }) 



1 + 



1/4 : 



1 



where <7o is the central velocity dispersion and r is the scale radius of the cluster. We now 
describe how we fitted Plummer profiles to the 47 Tuc radial velocity data to infer the values 
of the parameters cr and r$, and also to evaluate the overall appropriateness of the Plummer 
hypothesis (Hi) by comparing it to a more complex double Plummer model (H 2 ). 



The mechanism for carrying out this comparison is Bayesian model selection (jSivia fe Skilling 



20061 ). Suppose we have two (or more) competing hypotheses, Hi and H 2 , with each possibly 



-3 - 



containing different parameters 9\ and 9 2 . We wish to judge the plausibility of these two 
hypotheses in the light of some data D, and some prior information. Bayes' rule provides 
the means to update our plausibilities of these two models, to take into account the data D: 

P(H 2 \D) P(H 2 )P(D\H 2 ) 
PiH^D) P(Hi) P(D\Hi) 
P(H 2 ) J ei p(e 1 \H l )p(D\0uH 1 )d6 1 
" P(ifx) X f d2 p(e 2 \H 2 )p(D\6 2 ,H 2 )d6 2 [ ] 

Thus, the ratio of the posterior probabilities for the two models depends on the ratio of 
the prior probabilities and the ratio of the evidence values. The latter measure how well 
the models predict the observed data, not just at the best-fit values of the parameters, but 
averaged over all plausible values of the parameters. It should be noted that we rely solely 
on the velocity information for our Plummer model fits. Taking the stellar density as a 
function of radius into account would be useful in further constraining the models. However, 
the Plummer model fits by Lane et al. (in prep.) based exclusively on velocity information 
are also good fits to the surface brightness profiles of the four GCs analysed in that study. 



2.1. Single Plummer Model 

The data are a vector of radial velocity measurements v = {v 1 ,v 2 , ...,i>jv} of N stars, 
along with the corresponding distances from the cluster centre r = {ri, r 2 , rjv} and ob- 
servational uncertainties on the velocities a ^ s = {<7 bs,i, c bs,iv}- We will consider v to 
be the data, whereas r and (x b s are considered part of the prior information. In this case 
the probability distribution for the data given the parameters is the product of independent 
Gaussians, whose standard deviations vary with radius: 

p(v|/x,cr ,ro) = 

nfe-(-K^) 2 ))' 

where \i is the systemic velocity of the cluster. The standard deviation Oi for each data point 
is given by a combination of the standard deviation predicted by the Plummer model, and 
the observational uncertainty: 



°i = \J^Wo,r }) 2 + a 2 ohsi (4) 

To carry out Bayesian Inference, prior distributions for the parameters must also be defined. 
We assigned a uniform prior for ji (between —30 and 30 km s" 1 ). For a , we assigned 
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Jeffreys' scale-invariant prior p(o"o) oc l/oo for uq in the range 0.1 — 100 km s~ . Finally, ro 
was assigned the Jeffreys' prior p(r ) oc l/r for ro in the range 0.2 — 220 pc. These three 
prior distributions were all chosen to be independent and to cover the approximate range of 
values that we expect the parameters to take. 



2.2. Double Plummer Model 

The double Plummer model is a simple extension of the Plummer model. The stars are 
hypothesised to come from two distinct populations, each having its own Plummer profile 
parameters (but with a common systemic velocity /x). Thus, at any radius r from the cluster 
centre, we model the velocity distributi on as a mixture (weighted sum) of two Gaussians. 



From previous work (lLane et al.ll2009bl ) we also expect the inner regions of the cluster to 
be well fitted by a single Plummer profile, so the weight of the second population of stars 
should become more significant at larger radii. 

Thus, instead of having a single <7o parameter and a single ro parameter, there are now 
two of each. The probability distribution for the data given the parameters is then the 
weighted sum of two Gaussians: 

P(v|/x, {a },{r },w(r)) = 

N 



n 



exp 




+ ^Mexp l-l[2—H) I I. (5) 



Here, w(r) is a weight function that determines the relative strength of one Plummer profile 
with respect to the other, as a function of radius. We expect one component to dominate 
at smaller radii, and to eventually fade away as the second component becomes dominant. 
Hence, we parameterise the function w(r) as: 

. . exp (u(r)) , , 

w(r) = - PV ; 6 

v ' l + exp(u(r)) v 1 



where 



u(r) = u a + — — (u p - u a ) . (7) 



That is, the log of the relative weight between one Plummer component and the other 
increases linearly over the range of radii spanned by the data, starting at u a and ending at a 
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value up. Parameterising w via u makes it easier to enforce the condition that w(r) must be 
in the range — 1 for all r. The prior distributions for u a and up were chosen to be Gaussian 
with mean zero and standard deviation 3. This implies that w(r) will probably lie between 
0.05 and 0.95, with a small but not negligible chance of extending lower than 0.001 or above 
0.999. 

The standard deviations of the two Gaussians at each data point are given by a combi- 
nation of that predicted by the Plummer model and the observational uncertainty: 

°V = ^/a(ri;{(x ,ro}i) 2 + or2 bS)i (8) 

"»|2 = \/v(r i ;{a ,r } 2 ) 2 + cr2 hsi (9) 

The priors for all the parameters /i, {<Jo,r } were chosen to be the same as in the single 
Plummer case. 



Results 



An obvious rise in the velocity dispersion of 47 Tuc was described by lLane et al.l (l2009bl . 
their Figure 1 1) at approximately half the tidal radius (~ 28 pc). The tidal radius is ~ 56 pc 
(IHarrisI 119961 ) . To confirm the reality of this rise, several tests were performed, inc luding 
resizing the bins and shifting the bin centres, as described by lLane et al.l (l2009bl ). No 
difference in the overall shape of the dispersion profile was found during any tests. 



We used a variant (IBrewer et al.l 120091 ) of Nested Sampling (jSkillingl 120061 ) to sample 
the posterior distributions for the parameters, and to calculate the evidence values for the 
single and double Plummer models. The results are listed in Table HJ The double Plummer 
model is favoured by a factor ~ 3 x 10 7 , and consists of a dominant Plummer profile that 
fits the inner parts of the radial velocity data (Figure [1]), and a second, wider profile that 
models the stars at large radius. 

As a test of the veracity of our model we altered the model so that w(r) was linear (with 
endpoints in the range — 1 and with a uniform prior) rather than u(r) being linear. This 
had the effect of reducing the log evidence to ~ —7748, so in this case the double Plummer 
model is favoured by a factor ~ 10 5 . This best-fit model has a more subtle increase in width 
at large radii, when compared with Figure [TJ Presumably this is because w(r) being linear 
prevents more rapid fade-outs. Correspondingly, the Plummer profile represented by the 
thin curve in Figured was shifted slightly lower. Note that the model in Figure [1] is narrower 
than the spread of the data, because the spread also arises partly from observational errors. 
The most extreme points at large radii are likely to be those for which the intrinsic velocity 
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dispersion is large and the observational errors have pushed the points further away from 
the mean. In Figure El the Plummer profiles of the two population co mponents ar e shown . 
The inner component is a good fit to the binned velocity dispersions by lLane et al.l ( )2009bl ). 
We now discuss possible explanations for this two-component population, and calculate an 
upper limit on when the second component was introduced. 



3.1. Evaporation 



Drukier et al.l ( 120071 ) carried out iV-body simulations of GCs through core-collapse and 
into post-core-collapse. They showed that the evaporation of low-mass stars due to two-body 
interactions during these phases alters the velocity dispersion profiles in predictable ways. 
At approximately half the tidal radius (r*/2) the velocity dispersion reaches a minimum of 
~ 40% of the central dispersion, then rises to ~ 60% of th e central level at r* . These criteria 
are certainly met w ithin 47 Tuc (again, see Figure 11 of lLane et al.ll2009bl ). Furthermore, 
Lane et al.l (J2009bJ) conclude that the rise in velocity dispersion could b e explained by evap- 
oration due to the core collapsing, and the Fokker-Planck models by iBehler et al.l (120031 ) 
show that 47 Tuc is nearing core-collapse. 

This appears to be reas onable evidence that 47 Tuc is evap o rating . However, based 
on the conclusions drawn by iDrukier et al.l (120071 ) and lLane et al.l (120101 ) , it is unclear how 
much Galactic tidal fields affect the outer regions of Galactic globular clusters, and what 
effect this has on the external velocity dispersion profil e. While our b e st-fit double Plummer 
model matches the overall form of the trend shown by IDrukier et al.l (120071 ) reasonably well 
(see Figured]), this scenario does not explain its multiple stellar pop ulations (although these 
might be explained by chemical anomalies, e.g. iPiotto et al.ll2007l , and references t herein), 
nor its low M/Ly in comparison with its mass (see Figure 6 of lLane et al.l I201CH ) . or its 
extreme rotation. In addition, the extra-tidal sta rs are spread unifo rmly across all regions 
of the colour magnitude diagram (see Figure 3 of lLane et al.l l2009bl ) . which is inconsistent 
with the the two-component population being a consequence of evaporating low-mass stars. 
We cannot, however, completely discount evaporation without detailed chemical abundance 
information. 



3.2. Merger 



Another scenario, which appears to explain most of the unusual properties of 47 Tuc, 
is that it has undergone a merger in its past (note that this is not the first evidence for 
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Table 1: Inferred parameter values for the single Plummer and the double Plummer fits 
to the 47 Tuc data. The values quoted are the posterior mean ± the posterior standard 
deviation, when the marginal posterior distributions were sufficiently symmetric for this to 
be a reasonable summary. For the few parameters with asymmetric posterior distributions, 
we instead give the symmetric 68% credible interval. The evidence values imply that if 
the two models were equally likely before taking into account the data, the data makes the 
double Plummer model more likely by a factor of e 17 ' 3 ~ 3 x 10 7 . For the double Plummer 
model, the first value listed is for the component that dominates at r = 0. 



Parameter 


Value 


Single Plummer Profile 


n 


-16.87 ± 0.17 km s" 1 




9.37 ± 0.32 km s" 1 




9.27 ± 0.98 pc 


log(evidence) 


-7759.5 


Double Plummer Profile 


n 


-16.94 ± 0.12 km s" 1 




9.93 ± 0.43 km s" 1 




[5.70, 13.51] ± km s" 1 




6.76 ± 0.94 pc 




[53.6, 4560] pc 


u a 


6.30 ± 1.30 


Up 


-2.30 ± 1.12 


log(evidence) 


-7742.2 
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Fig. 1. — The radial velocities of stars in 47 Tuc together with the best fit double Plummer 
model for the velocity distribution as a function of radius. The inner part of the cluster is 
well modelled by the dominant Plummer profile, while at larger radii, the second plummer 
profile dominates. The radius at which the two profiles have equal weight is 55 pc. 




30 40 50 60 

Distance from Cluster Centre (pc) 



Fig. 2. — Binned velocity dispersion as a function of radius (from lLane et al.l l2009bl ) . with 
the radial velocity profiles of the two stellar populations from the best fit double Plummer 
model. The Plummer profile that dominates at small radii is shown as the thick black curve, 
while the thin curve shows the Plummer profile for the stellar population that dominates at 
large radii. 
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such a merger within Galactic GCs, see iFerraro et al.ll2002l ). Several observed quantities 
can be explained by this hypothesis: (1) the bimodality of the carbon and nitrogen line 
strengths, (2) the mixed stellar populations, (3) the large rotational velocity, (4) the low 
M/Ly compared with total mass and (5) the increase in velocity dispersion in the outskirts 
of the cluster. 



In addition to our evidence for two kinematically distinct stellar populations, lAnderson et al 



(120091 ) showed that 47 Tuc also has two distinct sub-giant branches, one of which is much 
broader than the other, as well as a broad main sequence. The authors determined that 
this broadening may be d ue to a combination of metallicities and ages, and it is known 
from previous studies (e.g. lHarbeck et al.l 120031 ) that 47 Tuc has a bimodality in its carbon 



and nitrogen line strengths. While this bimodality is not unique, a merger could explain 
its origin. Therefore, another possible scenario, which explains many of the properties of 47 
Tuc, is a past merger event. 

While it might seem unlikely that this is the remnant of a merger, extant ki nematic sig- 



natur es of subgalactic scale hierarchical merging do exist (e.g. within u Centauri; lFerraro et al. 



20021 ). and there have been hints th at the distinct photom etric populations in 47 Tuc might 



be remnants of a past merger (e.g. lAnderson et al.l 120091 ) . A possible explanation for this 



merger hypothesis is given in Section 13.31 



3.3. Initial Formation 



The two components may have formed at the same epoch, and the distinct kinematic 
populations are, therefore, e xtant remnants from th e formation of the cluster itself. If GCs 
form from a single cloud (see lKalirai fc Richer! 120091 for a discussion of GCs as simple stellar 
populations), it is possible for the proto-cluster cloud to initially contain two overdensities 
undergoing star formation independently at almost the same time. In this case, the two 
proto-clusters, which would inevitably be in mutual orbit due to the initially bound nature 
of the proto-cluster cloud, eventually coalesced through the loss of angular momentum due 
to dynamical friction. 



Note that this scenario is similar to the capture of a satellite described by IFerraro et al. 



(120021 ) to explain the merger hypothesis for oj Centauri, and explains the two kinematic 
and photometric populations, and the high rotation rate assuming the proto-cluster cloud 
initially had a large angular momentum. It might also explain the low M/Ly of this cluster. 



Interestingly, IVesperini et al.l (120091 ) show that clusters with initially segregated masses 
evolve more slowly than non-segregated clusters, having looser cores and reaching core- 
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colla pse much later. Be cause the core of 47 Tuc is highly concentrated and near core-collapse 
(e.g. iBehler et al.l 120031 ) it must be very old if it was initially mass-seg regated. 47 Tuc is 
thought to be 11-14 Gyr old (e.g. 
segregation is plausible. However 



Gratton et al.ll2003l ; iKaluzny et al.ll2007l ). hence initial mass 
even if 47 Tuc is ~ 11 Gyr old the original populations 
would be kinematically indistinguishable at the present epoch (Section I3.5P indicating that 
some other process was the cause of the two extant populations des cribed in this Letter . 
Furthermore, Milky Way GC formation ended about 10.8 Gyr ago (e.g. iGratton et al.ll2003l ). 
long before the upper limit for the initial mixing of the two populations derived in Section 

E2J 



3.4. Multiple Star Formation Epochs 



Two star formation epochs in GCs result in the radial separatio n of the two popula- 



tions, with the second generation initially concentrated in the core (e.g. ID'Ercole et al.ll2008 



and references therein). Furthermore, the kinematics of the second generation are virtu- 
ally independent of that of the first generation and the second generation contain chemical 
ano malies which are cons i stent with having ar isen in the envelopes of the first generation 



(.e.g. 



Decressin et al.l 120071 : ID'Ercole et al.ll2008l ). 



This scenario might explain the two kinematic populations and chemical anomalies of 
47 Tuc, however, it is unclear how this would cause the extreme rotation, or the anomalous 

M/Ly 



3.5. Time-scale for the Initial Mixing of the Second Population 



The scenarios described in Sections 



and 13.41 require a second populat i on be 



ginning to mix with an initial population at a particular epoch. iDecressin et al.l (120081 ) 
performed detailed iV-body simulations of globular clusters containing two distinct popu- 
lations of stars to determine their dynamical mixing time via two-body relaxation. They 
concluded that ~ 2 relaxation times are required to completely homogenise the populations, 
a timescale that is virtually independent of the number of stars in the cluster. They also con- 
cluded that the information on the stellar orbital angular momenta of the two populations 
is lost on a similar timescale. Since we observe two distinct, extant kinematic populations, 
an upper limit can be placed on when the two populations began to mix. 



Decressin et al.l (120081 ) showed that the relaxation time (t r h) of a GC decreases by ~ 
0.29t r h for each consecutive relaxation time, i.e. t r h(i) ~ 0.71t r h(i + 1). Because the two 
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populations are kinematica lly distinct a t the present epoch, and the current relaxation time 
of 47 Tuc is t r h w 3.02 Gyr (lHarrislll996l ). an upper limit on when the two populations began 
to mix is 7.3 ± 1.5 Gyr ago. Note that the scenarios discussed in Sections 13.21 and 1 3.31 can 
be temporally assessed in this way only if the merger did not disrupt the core of the large r 
component. This is true for minor mergers in blue compact dwarf galaxies (ISung et al.ll20 02). 



there fore, if we assume a merger between objects with a mass ratio of 9:1 ({Anderson et al. 



20091 . show the ratio of the number densities on the two sub-giant branches of 47 Tuc is 



j 9:l), this is also likely to hold for 47 Tuc. 



4. Conclusions 

With a Bayesian analysis of the velocity distribution of 47 Tuc, we conclude that the 
scenario which best explains the observed properties of 47 Tuc is that we are seeing the 
first kinematic evidence of a merger in 47 Tuc, which occurred < 7.3 ±1.5 Gyr ago. Extant 
kinematic populations from the merger of formation remnants is a plausible explanation as 
to the reason for this merger, assuming the two components evolved separately and merged 
< 7.3 ± 1.5 Gyr ago. This scenario could explain the two-component population, its extreme 
rotational velocity, mixed stellar populations and low M/Ly compared with its mass. 

All the other explanations for this two-component population are less plausible than the 
merger hypothesis. Evaporation of low mass stars is unlikely due to the various stellar types 
that are found beyond the tidal radius, and this also cannot explain its low M/Ly. The 
possibility of multiple star formation epochs does not explain the large rotational velocity, 
nor the low M/Ly. 

Detailed chemical abundances and high resolution A^-body simulations of merging glob- 
ular clusters are now required to further analyse the merger scenario. Several observed 
quantities need to be addressed, namely how much angular momentum can be imparted 
through a 9:1 merger, what consequence it has on the velocity dispersion in the outer re- 
gions of the cluster over dynamical timescales, and what effect it would have on the global 

M/Ly 

Of course, alternative explanations exist for the observed rise in dispersion. For example, 
if GCs form in a similar fashion to Ultra Compact Dwarf g alaxies, there may be a larg e 



quantity of DM in the outskirts of the cluster as discussed by lBaumgardt fc Mieskei (120081 ). 
However, no evidence exists supporting GCs fori 
to have significant dark matter components (e.g 



Lane et al. 


2009a 


b, 


2010) 
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